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THREE -TRANSISTOR NAND AND NOR GATES FOR 
TWO -PHASE CLOCK GENERATORS 



FIELD OF THE INVENTION 

5 The present invention is generally related to 

precision signal sampling circuits, and more particularly 
to sampling signals sensitive to signal-to-noise ratios, 
such as two-phase clock generators used in analog 
circuits including high-speed analog-to-digital 
10 converters (ADC's). 



BACKGROUND OF THE INVENTION 

In high-speed ADC applications, as well as in every 
other instance when sampling of an analog signal is 

15 involved (i.e. optical receivers, data stream "slicers", 
etc.), the precision of the sampling instant impacts the 
signal-to-noise ratio (SNR) of the discrete- time signal 
fed into the system. Accordingly, the stability of the 
timing reference is of paramount importance. From the 12- 

20 bit level accuracy on, the thermal noise contribution to 
SNR has to be minimized to such an extent that a noise 
contribution coming from sampling jitter of Ips or higher 
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becomes the limiting factor for SNR - at least from 70MHz 
input frequency on. 

A very clean time reference (OCXO, or other 
crystal -based solution, further band-pass filtered) must 
5 be provided to the non- overlapped phase generation 
circuits. In turn, the on-chip circuits must provide a 
clean transition edge to the sampling device - usually a 
simple switch - driven through a carefully optimized, 
short path within the clock distribution tree. The 

10 thermal noise of the logic gates, and especially the 
voltage bounce of the supply rails, can significantly 
degrade the stability of the clock period, introducing 
perturbations on the time of occurrence of the sampling 
edge which are inversely proportional to the slope of the 

15 waveforms featured at every node. 



TI-36597 



SUMMARY OF THE INVENTION 



The present invention achieves technical advantages 
by allowing the circuit designer to simplify the NAND/NOR 
5 logic gates used in non- overlapped phase generation 
circuits, eliminating 1 transistor out of 4 that can be 
proven redundant in a synchronization application. The 
associated reduction in the capacitive self-loading seen 
at the gate output advantageously enables much steeper 
10 fronts on the output voltage, eventually ameliorating the 
jitter performance of the whole clock generator. 

The present invention may be incorporated in a novel 
clock structure implemented for a high speed (80MSps) 
high- input frequency (for use in receivers with up to 

15 225MHz IF) 14-bit analog- to-digital converter (ADC), 
lowering the jitter from 530fs down to 230fs. To date, 
this ranks as the lowest jitter CMOS-based clock ever 
implemented and tested in an ADC. In turn, the jitter 
performance boosts the SNR performance of an ADC from 

20 about 60dBFS at 80MSps, 220MHz input to as high as 67dBFS 
in the same conditions. 

The present invention advantageously exploits the 
timing sequence of the signals driving a NAND gate. The 
present invention recognizes that since the pull-up 
25 operation of a node B2 is dictated by a node Bl only, 
keeping two PMOSFET devices in the NAND gate is 
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redundant. In fact, the PMOSFET 4 6 driven by node A4 in 
Figure 3, which will be described in detail shortly, 
covers an input -to -output transition which is possible in 
principle, but that never happens in practice when the 
5 NAND is embedded in the classical two-phase clock loop of 
Figure 1. According to the present invention, the 
structure of the NAND gate 44 is simplified whereby the 
NAND gate essentially becomes an inverter driven by node 
Bl, whose pulldown operation is only conditioned to node 
10 A4 . In other words, the essence of the NAND logic is 
preserved through the series of two NMOSFETs, whereas the 
active pull-up can be guaranteed by utilizing only one 
PMOSFET. 

The NAND gate of the present invention improves the 
15 clock generation and distribution on-chip, reducing the 
jitter of internal clock networks, and simplifies the 
structure of the NAND CMOS gates built inside the 
two -phase generator, providing an enhanced solution right 
at the very root of the problem. The principles and 
20 advantages of the present invention is also applicable to 
a NOR gate. 

The present invention finds particular advantages in 
two-phase non-overlapped clock circuits of 
Analog- to-Digital data converters, as well as any circuit 
25 having a two-phase non-overlapped clock generator. 
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BRIEF DESCRIPTION OF THE DRAWINGS 

Figure 1 is a schematic of a conventional two-phase 
non-overlapping clock generating network, wherein a 
transition at the output node A4 is not aligned with the 
5 complementary transition at output node B4 ; 

Figure 2 is a timing diagram illustrating the 
principles of the circuit in Figure 1 based on inducing a 
transition at the output node A4 and which is not aligned 
with the complementary transition at node B4, forcing 3 
10 gate delays between them due to one NAND and two 
inverters gates; 

Figure 3 depicts a conventional NAND gate used in 
the application of Figure 1 using a well-known structure; 

Figure 4 illustrates an alternative implementation 
15 of the two -phase generation circuit depicted in Fig, 1 
where the principle of duality is applied; to show how 
NOR gates, being dual of NAND gates, can benefit from 
this invention. In fact the pull -down edge of node B2 is 
solely controlled by the rising edge of the input at node 
20 Bl; 

Figure 5 is a NAND gate accordingly to the present 
invention which recognizes only one PMOSFET is needed to 
perform the output pull-up, and that A4 can drive the 
NMOSFET whose source is grounded, eliminating the body 
25 effect on the NAND device and making the pull -down 
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transition the quickest possible; 

Figure 6 depicts a NOR gate according to the 
principles of the present invention whereby the NMOSFET 
controlled by node A4 is eliminated; 

Figures 7 depicts the increased pull-up edge slope 
using the NAND gate of the present invention; 

Figure 8 depicts the pull -down transient driven by 
A4 being improved according to the present invention; and 

Figure 9 depicts the increased speed of the circuit 
according to the present invention for the pull -down 
transient driven by Bl at the circuit's start-up, due to 
the reduced capacitive "ballasting" at the output. 
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DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENT 

Referring to Figure 1, there is shown a prior art 
clock generator circuit 10, having a 2 input 6-gate loop 
5 network shown at 12 which performs non- overlapping of the 
two phases provided at inputs 14 and 16, and outputs a 
synchronization signal to a sampling device 18 at output 
20. An inverter tree 24 receives the respective output 
26 and 28 of the loop 12 and is a tapered clock 

10 distribution/edge regeneration circuitry. If the 

sampling device 18 receiving output 20 is modeled as a 
simple NMOSFET switch connected to a sampling capacitor 
for ease of discussion, the timing sequence in circuit 12 
shown at 32, 33, and 28 drives the HIGH >LOW edge that 

15 shuts down the sampling transistor 18 and constitutes the 
sampling event for the incoming signal. 

The operating principle of the loop network circuit 
12 is based on inducing a transition 34 at the output 
node A4 (26) of the non -overlapping network 12, which 

20 transition 34 is not aligned with the complementary 
transition 36 at output node B4 (28) of network 12. This 
non-alignment is due to triggering the falling edge 34 of 
node A4 on the rising edge of transition 36 of node B4 , 
thus forcing 3 gate delays (1 NAND + 2 inverters) between 

25 the transition fronts, as further illustrated in the 
timing diagram 40 of Figure 2. For sake of clarity, the 
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NAND gate 44 of circuit 12 is shown where nodes are 
identified by "B" letters. Since the input 15 at node Al 
of the NAND gate 42 of circuit 12 has already switched to 
the HIGH state synchronously with the node Bl falling 
5 edge, the state of NAND gate 44 is exclusively controlled 
by the output at node B4 . 

The rising edge of signal 33 at node B2 is instead 
synchronized to the falling edge of the input clock at 
node Bl. In fact, since the self -consistency of the loop 

10 12 forces the rising edge at node B2 to occur only when 
node A4 is HIGH, the input at node Bl is not "gated" by 
the NAND gate 44, which is transparent to it and acts as 
a simple inverter. On the other hand, the pull -down 
operation in the NAND gate 44 is only commanded by the 

15 other input, connected to node A4 . 

The standard, prior-art NAND gate 44 used for this 
application has the well-known structure as depicted in 
Figure 3, and includes 2 large PMOSFET's including 
PMOSFET 46. This configuration ensures coverage of all 
20 the states and output transitions of the digital "truth 
table" of the NAND gate 44, in every possible 
combination/sequence of the two inputs. 

A dual solution used for the generation of 
non-overlapped phase clock makes use of a NOR gate 48 
25 (and 49) in place of the NAND gate 42 and 44, and is 
shown at 47 in Figure 4. The whole timing diagram 40 in 
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Figure 2 is thus reversed: e.g., the NOR gate 49 pull-up 
front at node B2 is commanded by the output of the other 
side of the digital loop at node A4 of Figure 4, while 
the pull -down edge at node B2 is solely controlled by the 
5 rising edge of the input at node Bl . This configuration 
47 can benefit as well from the present invention, 
through direct application of the principle of duality, 
as will now be discussed in considerable detail. 

A NAND gate of the present invention is shown in 

10 Figure 5 at 50, which eliminates the one large PMOSFET 46 
shown in Figure 3. Since the pull-up operation of the B2 
node is dictated by Bl only, keeping two PMOSFET devices 
in the NAND gate is redundant. In fact, the PMOSFET 
driven by A4 in Fig. 3 covers an input >output 

15 transition which is possible in principle, but that never 
happens in practice when the NAND is embedded in the 
classical two-phase clock loop of Fig. 1. The structure 
of the NAND gate can be simplified, becoming an inverter 
driven by Bl, only, whose pull -down operation is 

20 conditioned to A4 . Since node A4 features a rising edge 
when node Bl is in the HIGH state, the present invention 
advantageously recognizes node A4 can be connected to 
NMOSFET 52 whose source is grounded to DVSS, rather than 
to NMOSFET 54. The alternative connection (A4 driving 

25 NMOSFET 54) could perturb output 60 via feedthrough 
charge injection. Besides avoiding this perturbation, the 
proposed connection eliminates the body effect on the 
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NAND 50, thus making the pull -down transition the 
quickest possible. Of course, the NMOSFET 54 controlled 
by node Bl is necessarily kept in this topology, since 
the input at node A4 is HIGH when the falling edge of the 
5 transition at node Bl occurs. Without the presence of 
the NMOSFET 54 controlled by node Bl, a DVDD-DVSS short 
with considerable "crowbar" current would engender. 

The simplified NAND gate 50 according to the present 
invention advantageously makes use of dynamic charge 

10 storage on the gates of the next inverter 17 and 19 
(shown in Figure 1) for a limited interval of time (when 
node A4 is LOW and node Bl is HIGH, i.e. immediately 
after the transition of the input clock and for a time 
span on the order of T^on - overlap ) where in that condition 

15 the output of NAND 50 remains effectively insulated. The 
NAND gate 50 does not entail any major limitation on a 
lower clock frequency of operation, since the charge must 
be stored in the output node 60 for intervals of about 
300-400ps, and sometimes less, independent on the 

20 sampling rate. Clock rates as low as 300kHz have been 
successfully tested with the new NAND gate 50 
configuration of the present invention. 

The straightforward application of the duality 
principle to the present invention with regard to the 
25 NAND gate 50 also is applicable to a NOR gate 70 shown in 
Figure 6. The device which can be eliminated from a 
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prior art NOR gate is a NMOSFET (shown hyphenated at 74) , 
in parallel to NMOSFET 72 and controlled by node A4 . It 
is apparent how the need for two PMOSFETs in series 
actually remains, still forcing the designer to employ 
bulky devices with less efficient channel mobility, and 
increased size to counteract the cascade effect. Thus, 
NOR gate 70 provides less of an advantage than NAND gate 
50 since the device which gets eliminated in NOR gate 70 
is the more effective NMOSFET transistor 74 in parallel 
to NMOSFET 72 in prior art realization. 

The simulated evidence of the beneficial application 
of the invention to the clock generator circuit of Figure 
1 is shown in Figures 7 to 9 for the NAND gate 50. 
Figure 7 depicts the faster transient behavior of the 
15 present invention at 80, as compared to a transient 82 
using conventional NAND 44, simulated for the pull-up 
transition triggered by a falling edge of node Bl in 
Figure 1. This is the most important transition occurring 
in the schematic 10 of Figure 1, namely, the falling edge 
20 of the input at node Bl controls the operation of the 
sampling device 18, opening the NMOSFET. In order to 
drive the sampling device 18 with as clean a signal as 
possible, the best choice is to derive the sampling front 
from the input clock through as few gates as possible, 
25 from node Bl to node B3 . The triggering event cannot come 
instead from the complementary branch, since it would 
have to travel through so many gates . 
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In the Spice simulation, the load of the NAND gate 
50 is an inverter 19 whose total size equals the size of 
the NMOS-PMOS pair 52, 54 of the NAND 50, with a 
threshold centered to mid-rail. Even with the same loads, 
5 the absence of a bulky PMOSFET shown in the prior art 
NAND gate 44 (shown in Figure 3) leads to a striking 
improvement in the steepness of the transition, as shown 
at 80. The slope of waveform 80 taken at the 1.65V 
threshold (3.3V supply) increases from 67.6V/jas to 
10 96.9V/jas, thus improves 43%, as shown in Figure 7. The 
advantageous enhancement can be fully capitalized into 
jitter reduction, provided all the other sources of 
instability (supply bounce, crosstalk, thermal noise of 
the other inverters/gates) have been properly minimized. 

15 The other most important transition after the 

falling edge at node Bl is the pull-down transient, 
governed by node A4 . Although the transient 84 of Figure 
8 could be theoretically used to drive a sampling event, 
it is not practical for very low jitter applications, 

20 such as a 14 -bit ADC converter for wireless applications, 
for the reasons explained above. Figure 8 shows the 
falling edge at the NAND 50 output 60 once the input 
source (node Bl) has gone high, and node A4 features a 
rising edge mandated by the opposite branch of the 

25 circuit 12. The overshoot 86 from the input is much 
reduced due to the lack of feedthrough from the large 
PMOSFET 46 which was connected to node A4 in the prior 
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art NAND 44 shown in Figure 3, which remarkably affects, 
instead, such a solution. The effect is exacerbated by 
the extremely steep input edge (lps) adopted in 
simulation. The NAND gate 50 of present invention 
5 prevents a detrimental overshoot altogether, as shown, 

since the other transition (node Bl LOW >HIGH) happens 

when the sampling transistor 18 is fully conducting, 
hence cannot be perturbed by transients superimposed to 
the 3.3V level. The pull-down is completed by node A4 
10 going HIGH, which only injects charge at node 56 between 
the two cascoded NMOSFETs 52 and 54 without affecting the 
output 60, as shown in Figure 8 (see 84) . Since node A4 
is advantageously set to drive the FET transistor 52 with 
grounded source, no body effect hampers the transient 84. 

15 The new NAND gate 50 still demonstrates a clear 

advantage over the prior art NAND 44. Since the NAND gate 
50 has been designed to feature symmetrical in/out 
characteristic and pull up/down behavior, the pull -down 
slopes of NAND gate 50 and the prior art NAND gate 44 are 

20 almost identical to the ones previously detected during 
pull-up: 95.7V/ (as against 66.6V/|as, or a 44% progress. 

Finally, a transition never occurring during the 
normal operation of the circuit using NAND 50, but that 
may occur during the initial transient, is the output 
25 pull -down triggered by the input clock, or node Bl going 
HIGH. Although the overshoot caused by the feed- through 
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from the input is present in this case, and is even 
slightly worse due to the reduced capacitive "ballasting" 
at the output 60, the speed of the NAND gate 50 still 
largely ameliorates the prior art: the slopes detected at 

5 90 and 92 are respectively 98.8V/ jus versus 66.7V/ (is, or a 
4 8% enhancement, as shown in Figure 9. 

Despite the NAND gate 50 lacks some of the otherwise 
allowed transitions on the truth table (i.e. node A4 
cannot command a rising edge at the output) the NAND gate 
10 50 does not cause any metastable states in the circuit 10 
of Figure 1. In fact, any "latched" voltage configuration 
is resolved at the next edge after half a period, hence, 
the network 12 comes out of metastability within 1 clock 
period - worst case. 

15 The NAND gate 50 is superior to any passive-load 

gate in terms of pull-up capability and static power 
consumption (zeroed) , in the same way as every CMOS 
implementation is superior to all NMOS schemes. The 
preservation of the active nature of the NAND logic gate 

20 50, not resorting to passive elements but always having a 
transistor driving the output, constitutes another 
advantage . 

The NAND gate 50 is superior to the classical CMOS 
implementation gate 44 in that it allows to reduce the 
25 self-loading at the gate output 60, saving dynamic power 
consumption and area in the gate itself and in all the 
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previous inverters driving it, in a tapered "domino 
effect". Plain CMOS solutions 44 have been compared 
ceteris paribus to the NAND gate 50 in simulation and 
show voltage transients prone to generating jitter in the 
5 clock circuit designed on chip, unlike NAND gate 50. 

One main advantage of the present invention is the 
enhancement in the slope of the output transitions, both 

LOW > HIGH and HIGH > LOW, as demonstrated in Figures 

7 and 8. The advantage comes from the NAND gate 50 

10 having the same driving devices, but one less idle 
transistor 46 connected to the output 60. In the NAND 
gate 50, since the device eliminated is a PMOSFET 46, 
whose size exceeds the NMOS counterparts 52, 54 by a 
factor 2.5 to 3 to ensure symmetrical transfer function 

15 and optimize the noise margins, the gain in steepness is 
particularly dramatic. Since the RMS timing uncertainty 
of the clock period (a T ) generated by the circuit 12 
using NAND gate 50 is related to the RMS voltage noise at 
the output of each logic gate by dividing it by the slope 

20 of the waveform, such an advancement directly impacts the 
jitter performance of the clock. In a system like an ADC, 
or, more in general, for every application requiring a 
Sample/Hold stage, in turn this translates into the 
aperture uncertainty specification. 

25 For example, in one conventional 14 -bit 80MSps ADC 

which adopts a prior art NAND gate 44 shown in Figure 1, 
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the utilization of the NAND gate 50 advantageously 
contributes to a jitter figure improvement from 530fs 
down to 230fs. With IF (Intermediate Frequencies) of the 
transceiver chains being pushed higher than 2 00MHz, and 
5 resolutions of 12-14bits, the jitter contribution becomes 
the dominant one, and such a reduction entails more than 
6dB of difference in SNR and possibly in SINAD, hence >1 
bit in ENOB. 

The final inverter driving node A4 now only sees the 
NMOSFET 52 of the NAND gate 50, and not PMOSFET 46, and 
can be sized smaller than when driving prior art NAND 44. 
In turn, given the customary adoption of an exponential 
tapering law for the inverters inside the loop 12 of 
circuit 10 of Figure 1 and the ensuing buffer chain, a 
load reduction in the inverter driving node A4 reflects 
into a reduction in the inverter driving node A3, and in 
turn allows for a smaller NAND sizing to begin with. 
Besides optimizing jitter, the present invention thus 
enables a lower power consumption for the clock circuit 
10 as a whole. 

A straightforward advantage coming from the 
reduction in the transistor count is the decrease in area 
and wiring layout complexity. 

As shown in Figures 7-9, the charge injection of 
25 NAND gate 50 into the output 60 is either the same as in 
prior art (pull-up transition), or, completely canceled 
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out (pull-down) . In fact, either the PMOS device governed 
by node Bl executes the pull-up, and the transistor 
configuration is same as the standard gate, or, the NMOS 
governed by node A4 executes the pull -down, which is not 
5 directly tied to the output 60 and allows for a transient 
without feedthrough. Furthermore, to start from the rail 
voltage, and not above it, results eventually in a faster 
transition. 

As a corollary, the lack of signal feed-through 
10 taking the output voltages beyond the rails during 
certain transitions implies less stressing of the driven 
inverter gates, or, better compliance with reliability 
guidelines . 

It has been mentioned that the new gates dynamically 
15 store charge. As opposite to the host of dynamic logics 
that are based onto this principle (NORA, Domino logics) , 
the present invention a) does not employ any form of 
clocked precharge; and b) does not require the charge to 
be held for half a cycle (as shown for prior art NAND 44 
20 in Figure 2) : T nonoV eriap << T cyc i e /2 is the required storage 
time . 

The present invention finds particular advantages in 
clock circuits of Analog-to-Digital data converters as 
well as any circuit having a two-phase non-overlapped 
25 clock generator. 
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Though the invention has been described with respect 
to a specific preferred embodiment, many variations and 
modifications will become apparent to those skilled in 
the art upon reading the present application. It is 
5 therefore the intention that the appended claims be 
interpreted as broadly as possible in ^iew of the prior 
art to include all such variations and modifications. 
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